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SUMMARY 

Differentiated cells can be reprogrammed into induced pluripotent stem cells (iPSCs) after overexpressing four transcription factors, of 
v^^hich Oct4 is essential. To elucidate the role of Oct4 during reprogramming, v^^e investigated the immediate transcriptional response 
to inducible Oct4 overexpression in various somatic murine cell types using microarray analysis. By dov^^nregulating somatic-specific 
genes, Oct4 induction influenced each transcriptional program in a unique manner. A significant upregulation of pluripotent markers 
could not be detected. Therefore, OCT4 facilitates reprogramming by interfering v^^ith the somatic transcriptional netv^^ork rather than 
by directly initiating a pluripotent gene-expression program. Finally, Oct4 overexpression upregulated the gene Mgarp in all the analyzed 
cell types. Strikingly, Mgarp expression decreases during the first steps of reprogramming due to a KLF4-dependent inhibition. At later 
stages, OCT4 counteracts the repressive activity of KLF4, thereby enhancing Mgarp expression. We shov\^ that this temporal expression 
pattern is crucial for the efficient generation of iPSCs. 



INTRODUCTION 

Somatic cells can be reprogrammed into induced pluripo- 
tent stem cells (iPSCs) after the forced expression of three 
or four transcription factors: Oct4, Sox2, Klf4, and option- 
ally, Myc (Nakagawa et al., 2008; Takahashi and Yamanaka, 
2006). Oct4 is indispensable for establishing pluripotency 
in the embryo (Nichols et al., 1998) and for maintaining 
pluripotency in mouse embryonic stem cells (ESCs) (Niwa 
et al., 2000). Under physiological conditions, the OCT4 
protein needs to interact with SOX2 for activating most 
of its target genes and for maintaining the self-renev^al of 
ESCs (Boyer et al, 2005; Remenyi et al., 2003; Rodda 
et al., 2005). Nevertheless, overexpression of Oct4 can 
rescue ESC self-renewal in the absence of Sox2 (Masui 
et al., 2007), indicating that Sox2 is not essential for sup- 
porting pluripotency. In addition, Sox2 can be replaced 
by other Sox factors or by transforming growth factor p 
inhibitors in the reprogramming of somatic cells into iPSCs 
(Ichida et al., 2009; Nakagawa et al, 2008). Likewise, Klf4 is 
dispensable for maintaining ESC self-renewal (Jiang et al, 
2008) and for inducing pluripotency (Nakagawa et al, 
2008), as Klf2 and KlfS can replace Klf4 in both functions. 
In addition, Esrrb can also substitute Klf4 in iPSCs gene- 
ration (Feng et al., 2009). Therefore, Oct4 is the only 
transcription factor in the conventional reprogramming 
cocktail that is essential for pluripotency. 

To date, the role of OCT4 in reprogramming has been 
studied only in the context of its interaction with SOX2 
(Buganim et al., 2012; Hansson et al, 2012; Polo et al. 



2012; Sridharan et al, 2009; Stadtfeld et al, 2008). As exog- 
enous Sox2 is not required for inducing pluripotency 
(Ichida et al., 2009; Maherali and Hochedlinger, 2009), 
we decided to investigate the specific effect of OCT4 alone 
in the first steps of reprogramming. To circumvent the 
inevitable heterogeneity generated by viral factor delivery, 
we established a set of different somatic cell types from 
tetracycline-inducible Oct4 transgenic mice. This approach 
facilitates the study of rare events in cell populations that 
simultaneously activate Oct4. Finally, we performed global 
gene-expression profile analyses to identify the early tran- 
scriptional changes caused by ectopic Oct4 expression in 
somatic cells. Overall, our study provides insights into 
the specific OCT4-dependent events that promote the in- 
duction of pluripotency. 



RESULTS 

Generation and Characterization of Different 
Oct4-Inducible Somatic Cell Types 
To analyze the immediate transcriptional response trig- 
gered by Oct4 overexpression in somatic cells, we derived 
mouse embryonic fibroblasts (MEFs), neural stem cells 
(NSCs), and bone marrow cells (BMCs) from mice con- 
taining both a tetracycline transactivator and a tetracy- 
cline-inducible Oct4 transgene and termed these cells 
tetracycline-operon-controlled Oct4-expression cassette 
(TO)-MEFs, TO-NSCs (cell line number 1), and TO-BMCs, 
respectively. In addition, we established an NSC line from 
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mice containing only the tetracycline transactivator 
(CtrlNsc) as a control to detect nonspecific tetracycline 
transactivator activities unrelated to OCT4. Finally, non- 
transgenic MEFs were also used as a control to identify 
effects caused by the tetracycline treatment itself (CtrlMEp) 
(Figure lA). 

Using immunocytochemistry, we assessed the level of 
OCT4 protein expression after 24 hr of doxycycline induc- 
tion in each generated cell type. We counted 98% of 
TO-MEFs to be positive for OCT4 staining, with intensity 
levels comparable to those observed in ESCs (Figure IB). 
In contrast, only 40% of TO-BMCs and 10% of TO-NSCs 
(number 1) exhibited strongly induced OCT4 expression. 
Interestingly, 40% of TO-NSCs (number 1) became OCT4 
positive after treatment with 5-azacytidine (Figure IB), 
suggesting the presence of a DNA-methylation-based 
mechanism for transgene silencing in this cell type. For 
this reason, we decided to exclude TO-NSC line number 
1 from our study. Instead, we transduced Ctrlnsc with a len- 
tiviral vector coding for the same TO cassette that is present 
in the other cell types (Stadtfeld et al, 2008; Figure lA). 
This newly generated TO-NSC line (number 2) could effi- 
ciently induce OCT4 expression in 98% of the cells after 
doxycycline treatment and was thus the only TO-NSC 
line used in the subsequent experiments (Figure IB). 
Next, the induction of exogenous Oct4 expression was 
analyzed in a time course manner by quantitative RT-PCR 
(qRT-PCR). After 6 hr, all cell types exhibited ESC-like 
Oct4 transcript levels (Figure IC). Furthermore, immuno- 
blotting confirmed OCT4 expression at the protein level 
(Figure ID). We also surveyed the onset of OCT4 trans- 
lation in more detail by immunocytochemistry. OCT4 
protein emerged as early as 2 to 3 hr after induction and 
reached saturation levels after approximately 6 hr (Fig- 
ure IE). Consistent with previous observations (Hochedlin- 
ger et al., 2005; Wang et al., 2011), in-vitro-cultured 
somatic cells showed impaired proliferation after Oct4 
overexpression (Figure SI available online). 

To assess whether the Oct4 induction levels are sufficient 
for iPSC generation, we transduced the different cell types 
with retroviral vectors coding for Sox2, Klf4, and Myc. After 
inducing Oct4, we generated Oct4-GFP-positive, ESC-like 
colonies from all somatic cell types and established clonal, 
doxycycline-independent cell lines (Figure 2A). The reprog- 
ramming efficiency and kinetics using the inducible Oct4 
transgene were similar to those achieved using viral deliv- 
ery of Oct4 (data not shown). The generated iPSC lines 
stained positive for the pluripotency markers alkaline 
phosphatase (ALP; Figure 2A), NANOG, and SSEA-1 (Fig- 
ure 2B). As ESCs, iPSCs exhibited unmethylated Oct4 and 
Nanog promoters (Figure 2C). Moreover, all iPSC lines 
were capable of differentiating into all three germ layers 
in vitro and in vivo after embryoid body formation (Fig- 



ure 2D) and blastocyst injection (Figure 2E), respectively. 
Finally, germline contribution was demonstrated by the 
presence of Oct4-GFP-positive germ cells in the fetal gonads 
(Figure 2E). In summary, we have derived three different 
somatic cell types that can efficiently induce Oct4 expres- 
sion to levels supporting iPSC generation. 

Oct4 Induction Affects the Gene-Expression Profile in 
a Cell-type-Specific Manner 

To identify the OCT4 targets during the first steps of 
reprogramming, we collected samples of the different 
Oct4-inducible cells and controls at 0, 6, 12, and 24 hr 
after Oct4 induction with doxycycline treatment. Global 
gene-expression profiles were then recorded by microarray 
analysis. Upregulation of Oct4 expression was clearly 
confirmed in all inducible cell types, but not in the controls 
(Figure 3A). The two corresponding Oct4-detecting array 
probes were not further considered in the subsequent 
analyses. Induced TO-MEF and TO-NSC cultures, which 
express the Oct4 transgene most uniformly, exhibited a 
stronger transcriptional response than TO-BMCs. CtrlMEF 
cells showed hardly any transcriptional response to the 
doxycycline treatment. However, differential expression 
of some genes was observed in the tetracycline-transactiva- 
tor-containing CtrlNsc cells, suggesting the need for a 
more-refined analytic strategy that could precisely distin- 
guish OCT4-dependent from OCT4-independent effects 
(Figure 3A). Therefore, we assigned a score to each probe 
in each cell type. This score is based primarily on the total 
signal change within 24 hr but also incorporates additional 
parameters such as expression changes in the controls, the 
dynamic behavior throughout the time course, and the 
absolute signal intensity as an indicator for noise artifacts. 
Probes with a score higher than 1 or lower than - 1 were 
defined as up- or downregulated, respectively (for details, 
see Experimental Procedures and Figure S2). Surprisingly, 
the genes that were most strongly upregulated differed 
greatly from one cell type to another, indicating that the 
immediate targets of OCT4 are unique to each cell type 
(Figure 3B). Interestingly, fewer than 50% of the genes 
that were upregulated by OCT4 in the different cell types 
are expressed in ESCs (Figure SB) and only a small subset 
(<4%) of these ESC-expressed genes is ESC-specific, i.e., 
not expressed in any other cell type examined (Table SI). 
The genes that were most strongly downregulated by 
OCT4 expression differed among the cell types, and 
many genes were specific to each cell type (Figure 3C). In 
fact, 21.0%, 12.4%, and 70.1% of genes downregulated in 
TO-MEFs, TO-NSCs, and TO-BMCs, respectively, were not 
expressed in any other cell type in our data set, suggesting 
a possible early effect of Oct4 expression in the disruption 
of the respective somatic regulatory program. Surprisingly, 
many downregulated genes are also highly expressed in 
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Figure 1. Generation and Charactenzation of Tetracycline-Inducible 0ct4 Cells and Controls Used for Dynamic Global Gene- 
Expression Analysis 

(A) Fibroblasts (TO-MEFs), neural stem ceLLs (TO-NSCs number 1), and bone marrow ceLLs (TO-BMCs) contain a tetracycline transactivator, a 
tetracycLine-operon 0ct4-expression cassette, and an 0ct4-GFP reporter. NSCs with a tetracycline transactivator and an 0cf4-GFP reporter 
but Lacking a tetracycLine-operon 0ct4 cassette (CtrLNsc) served as a control for nonspecific transactivator activity. WiLd-type C57BL/6 
fibroblasts were used as a control for drug response artifacts (CtrLMEp)- TO-NSCs number 2 were generated after transducing CtrLNsc with a 
LentiviraL vector coding for a tetracycLine-operon 0ct4. Transcriptional effects of 0ct4 expression were investigated by microarray analysis 
at different time points after doxycycLine induction. 

(B) Immunological staining for 0CT4 protein (red) in the different cell types after 24 hr of doxycycline induction. Nuclei were counter- 
stained with Hoechst (blue). The scale bars represent 50 |im. 

(C) Expression of 0ct4 mRNA (qRT-PCR), normalized to an embryonic stem cell (ESC) control. The error bars represent SE of calculations 
based on ACt values obtained from two different housekeeping genes: Gapdh and Actb (n = 1). 

(D) 0CT4 protein immunoblot of induced and uninduced samples. GAPDH was used as a loading control. 

(E) 0CT4 protein translation time course after doxycycline induction. 0CT4 protein and nuclei are shown in red and blue, respectively. The 
scale bars represent 50 |im. 

Abbreviations: dpc, days postcoitum; dpp, days postpartum; Dox, doxycycline; AzaC, 5-azacytidine; TO, tetracycline-inducible. See also 
Figure SI as well as Tables S5 and S6. 
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Figure 2. Characterization of iPSCs Generated from Tetracycline-Inducible 0ct4 Cells 

(A) CLonaL induced pLuripotent stem ceLL (iPSC) Lines generated by tetracycLine-induced 0ct4 expression in combination with viraLLy 
delivered Sox2, Klf4, and Myc. Phase-contrast overlay with 0ct4-GFP reporter fluorescence and ALP staining. The scale bars represent 50 |im. 

(B) Immunological stainings for NANOG and SSEA-1 (red), overlaid with nuclear Hoechst staining (blue). The scale bars represent 50 |im. 

(C) DNA methylation of 0ct4 and Nanog promoters in iPSCs and controls. Empty and full circles represent unmethylated and methylated 
CpG cytosine residues, respectively. 

(D) Immunocytochemistry for specific marker proteins (red) showed in vitro differentiation of iPSC clones into all three germ layers. Nuclei 
were counterstained with Hoechst (blue). The scale bars represent 50 |im. 

(E) PCR genotyping for the GPP transgene demonstrated that iPSCs could contribute to organs representative of all germ layers in 14.5 dpc 
chimeric embryos. Germline contribution was proven by the presence of 0cf4-GFP-positive cells in the fetal gonads. The black scale bars 
represent 5 mm. The white scale bars represent 250 |im. 

Abbreviations: TUBB3, p-III-tubulin (ectoderm; neurons); ACTA2, a-II-actin (mesoderm; smooth muscle cells); S0X17, SRY-box- 
containing 17 (definitive endoderm); br, brain; he, heart; li, liver. See also Tables S5 and S6. 
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ESCs (55.4%, 79.1%, and 16.6%inTO-MEFs,TO-NSCs, and 
TO-BMCs, respectively; Figure 3C). To further investigate 
whether the OCT4 target genes reflect common cellular 
processes, we performed gene ontology analyses (Tables 
S2-S4). The few statistically enriched ontology terms ap- 
peared to be remarkably unique for each cell type. Strik- 
ingly, no ontology terms for upregulated genes were found, 
with the exception of the unexpected term ''neuron projec- 
tion" in TO-MEFs. In contrast, the downregulated genes 
were enriched for ontologies related to the specific cell 
type. Moreover, TO-NSCs showed downregulation of genes 
related to the ''mitotic cell cycle" and other similar terms. 
Several of these genes were also downregulated in induced 
TO-MEFs. This result correlates with the observed prolifer- 
ation impairment caused by Oct4 overexpression in both 
cell types (Figure SI). 

An unsupervised hierarchical clustering (Figure S3A) and 
a principal-component analysis (PC A) (Figure 3D) showed 
all the time points for a specific cell type clustering 
together, indicating that the OCT4-induced changes in 
global gene expression were rather small, compared to 
the overall differences distinguishing one cell type from 
the other. However, TO-MEFs and TO-NSCs after 24 hr of 
Oct4 induction were more different from their uninduced 
counterparts than were CtrlMEF and CtrNsc cells, respec- 
tively. We also compared the numbers of differentially ex- 
pressed genes between each pair of samples (Figure S3B). 
Importantly, none of the cell types became more similar 
to ESCs upon Oct4 induction, as assessed by gene expres- 
sion. PCA confirmed that the transcriptional program of 
the different cell types shifted away from that of ESCs 
and revealed that the direction of gene-expression changes 
was unique to each cell type (Figure 3D). Overall, these 
results suggest that, early in reprogramming, Oct4 ex- 
pression interferes with the somatic cell transcriptional 
network in a cell-type-specific manner, but does not 
directly induce a pluripotent cell fate or a gene-expression 
signature common to all cell types examined. 

Mgarp Is One of the Few Genes that Are Commonly 
Upregulated in All Cell Types 

To identify common targets of OCT4, we directly compared 
the sets of up- and downregulated genes for each individual 
cell type but only detected a few overlaps (Figures 4A and 
4B). From a total of 1,441 upregulated probes, 102 (7.1%) 
were upregulated in two different cell types, and only 
four genes (0.3%), namely Parml, Mgarp, Slc24a3, and 
TmemSS, were commonly upregulated in all three different 
cell types (Figure 4A). The intersections within the downre- 
gulated genes were even smaller, as 107 (6.7%) of 1,587 
probes were downregulated in two different cell types. 
Not a single gene was commonly downregulated in TO- 
NSCs, TO-MEFs, and TO-BMCs (Figure 4B). The vast major- 



ity of both up- and downregulated genes thus represent 
unique effects of Oct4 expression that occur only in a spe- 
cific cellular environment. Of note, the nonuniform induc- 
tion pattern observed in TO-BMCs might contribute to this 
finding, as uninduced cells will dilute the measurable gene- 
expression changes. However, TO-MEFs and TO-NSCs, 
which exhibit homogeneous Oct4 induction, have in com- 
mon only 6.5% of their up- and downregulated genes. We 
also identified 71 genes that were oppositely regulated by 
Oct4 expression, i.e., they were upregulated in at least one 
cell type while being downregulated in at least another 
(Figure 4C). Furthermore, 83.2% of the commonly down- 
regulated genes are expressed in ESCs. In contrast, only 
58.7% of the commonly upregulated genes are expressed 
in ESCs and as few as 5.8% are ESC-specific (Figure 4C). 
Surprisingly, these ESC-specific genes do not include any 
key pluripotency markers (Table SI). Finally, we selected 
the four commonly upregulated as well as two commonly 
downregulated and two oppositely regulated genes (Fig- 
ure 5 A) and validated their dynamic gene-expression 
changes using qRT-PCR (Figure 5B). Our results indicate 
that ectopic Oct4 expression does not activate a common 
transcriptional program in different somatic cell environ- 
ments and show that only four genes are commonly upre- 
gulated in all the cell types examined. 

OCT4-Dependent Upregulation of Mgarp Is 
Counteracted by KLF4 to Ensure Successful 
Reprogramming 

We then investigated Parml and Mgarp, the two genes most 
strongly upregulated by OCT4 in all the three cell types 
examined. Overexpression and knockdown of Parml had 
no effect on iPSC generation (data not shown), and there 
is no indication of an association between this gene and 
pluripotency or reprogramming. However, Mgarp (mito- 
chondria-localized glutamic acid rich protein) is present 
in both ESCs and epiblast stem cells. Mgarp expression 
was not detected in early primordial germ cells but was 
induced to ESC-like levels in male germ cells at 13.5 days 
postcoitum (dpc) and maintained in spermatogonial stem 
cells after birth. In contrast, female germ cells did not ex- 
press Mgarp (Figure S4). Furthermore, Mgarp is transcribed 
in several organs such as ovary, testis, the visual system, 
and the brain (Kinouchi et al., 2006; Li et al., 2009; Matsu- 
moto et al, 2009; Qi et al, 2011). 

To investigate a possible role of Mgarp in reprogramming, 
we generated a set of viral vectors to increase or decrease the 
Mgarp expression level in a constitutive or tetracycline- 
inducible manner (Figure S5). MEFs, NSCs, and BMCs 
bearing an Oct4-GVV reporter transgene were transduced 
with viruses coding for mouse Oct4, Sox2, Klf4, and Myc 
plus Mgarp or small hairpin RNA against Mgarp (shMgarp). 
Three weeks later, we assessed for ALP activity (Figure 6A) 
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and counted the number of Oct4-GFP-positive iPSC col- 
onies (Figure 6B). Strikingly, Mgarp constitutive overexpres- 
sion caused a dramatic decline in colony numbers to fewer 
than 10% of the respective control (Figure 6B). This indi- 
cates that, although Mgarp is expressed in pluripotent 
stem cells and upregulated by the key reprogramming fac- 
tor OCT4, high Mgarp levels are detrimental to the reprog- 
ramming process. Unexpectedly, constitutive knockdown 
of Mgarp did not have the opposite effect — ^but it also signif- 
icantly reduced the number of iPSC colonies in all the 
analyzed cell types (Figure 6B). To better understand this 
ambivalent role of Mgarp in reprogramming, we analyzed 
the effect of the reprogramming factors OCT4, SOX2, 
KLF4, and MYC, individually and in combination, on the 
transcriptional activation of Mgarp. Consistent with our 
previous observations, OCT4 strongly and stably induced 
the upregulation of Mgarp expression. SOX2 and MYC 
did not have any effect on Mgarp expression. In contrast, 
KLF4 strongly inhibited the transcription of Mgarp (Fig- 
ure 6C). A very interesting pattern of expression was 
observed after the combined overexpression of Oct4 and 
Klf4 alone or together with Sox2 and Myc. In fact, Mgarp 
expression levels first decreased until day 6 after transduc- 
tion but then started to increase. This temporal expression 
pattern was maintained at lowered levels in the presence 
of two different constitutive shMgarp constructs and was 
completely disrupted by constitutive Mgarp overexpression 
(Figure 6C). Based on these results, we hypothesized that 
continual high levels of Mgarp {Oct4, Sox2, Klf4, and Myc 
[OSKM] + Mgarp) as well as constant low levels of 
Mgarp (OSKM + shMgarp) interfere with the initial 
Mgarp downregulation and the subsequent Mgarp up- 
regulation steps induced by KLF4 and OCT4, respectively 
(Figure 6C). To directly test this hypothesis, we transduced 
MEFs with Oct4, Sox2, Klf4, and Myc plus tetracycline- 
inducible vectors for Mgarp overexpression or knockdown. 
The expression or inhibition was induced at different 
time points (Figure 6D). Mgarp overexpression within the 
first 3 days of the reprogramming process significantly 
reduced colony numbers, as it interfered with the 



initial step in which the level of Mgarp needs to decrease. 
Consistently, Mgarp induction at later stages did not affect 
the reprogramming efficiency. As expected, Mgarp knock- 
down, which interferes with late steps, was detrimental 
for the induction of pluripotency independently of the 
time of onset (Figure 6D). Therefore, our results demon- 
strate that a specific two-step time course of Mgarp expres- 
sion is required for reprogramming and that disturbing 
either of the two steps impairs iPSC generation. 

Next, we investigated whether OCT4 and KLF4 regulate 
Mgarp by directly binding to regulatory regions within its 
genomic locus. OCT4 protein has previously been shown 
to bind near the Mgarp gene in mouse ESCs using a 
chromatin immunoprecipitation (ChlP)-chip approach 
(Mathur et al., 2008). Indeed, we identified a putative 
OCT4 binding motif (Chen et al, 2008) in the first intron 
of Mgarp (Figure 7A). Furthermore, the Mgarp promoter fea- 
tures a sequence with a high similarity to the consensus 
KLF4 binding motif (Chen et al., 2008; Figure 7A). To assess 
whether OCT4 and KLF4 bind to the Mgarp locus, we 
designed a set of primers that span approximately 5 kb 
up- and downstream of the Mgarp transcription start site 
(Figure 7k) and performed ChlP-quantitative PCR (qPCR) 
in MEFs overexpressing Oct4 or Klf4. We detected an 
enrichment of both OCT4 (Figure 7B) and KLF4 (Figure 7C) 
in proximity of the identified candidate binding sites. 
Interspecies genome comparisons further supported the 
notion that Mgarp is a direct target gene of OCT4 (Fig- 
ure 7D) and KLF4 (Figure 7E), as both binding motifs are 
well conserved across a variety of mammalian species. 

The MGARP protein is localized in the mitochondria 
and was reported to alter mitochondrial distribution and 
transport (Li et al., 2009). Interestingly, a crucial step in 
reprogramming is the transition from a somatic-oxidative 
to a pluripotent-glycolytic metabolism by remodeling the 
mitochondrial network (Folmes et al., 2011; Kelly et al., 
2013). To test whether the role of Mgarp in reprogramming 
is related to mitochondria, we overexpressed Mgarp in 
MEFs. However, we did not observe any effects on mito- 
chondria numbers or respiratory activity (Figure S6). 



Figure 3. Global Gene-Expression Analysis after Induction of 0ct4 Alone 

(A) Scatter plots comparing expression profile before and after 0ct4 overexpression for each ceLLtype examined (Log2 scale). Probes with 
increased and decreased intensity upon 0ct4 induction are depicted in red and green, respectively. The blue area contains equally ex- 
pressed probes (<2-fold difference). The two 0ct4 probes are highlighted. 

(B) Identification of the 0CT4-induced upregulated genes. Heatmaps show the temporal changes in signal intensities for the ten genes 
presenting the highest score value. The expression levels in an ESC control are shown for comparison. 

(C) Identification of 0CT4-induced downregulated genes. Heatmaps show the temporal changes in signal intensities for the ten genes 
presenting the lowest score. 

(D) Principal-component analysis (PCA) of global transcriptome data. The three principalcomponents that contribute moststrongly to the 
variability among the individual samples are shown. The numbers inside the circles denote hours after 0ct4 induction. P.C., principal 
component. 

See also Figures S2, S3, and Tables S1-S4. 
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Figure 4. Shared and Unique 0ct4 Targets 

(A) Venn diagram depicting overlaps of upreguLated gene sets among the different ceLL types after 0ct4 induction. 

(B) Venn diagram depicting overlaps of downregulated gene sets. 

(C) Common up, genes with highest scores that are upreguLated in at Least two different ceLL types. Common down, genes with Lowest scores 
that are downreguLated in at Least two different ceLL types. Contrary, genes that are oppositeLy expressed among at Least two different ceLL 
types. Unique up, genes with highest scores that are upreguLated in onLy one ceLL type. Unique down, genes with Lowest scores that are 
downreguLated in onLy one ceLL type. 
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Figure 5. Validation of 0CT4 Target Genes 
Identified in the Microarray Analysis 

(A) The only four commonLy upreguLated 
{Parml, Mgarp, Slc24a3, and Tmem53) and 
two exampLes of commonLy downreguLated 
{AdoraZb and Kank4) as weLL as contrarUy 
regulated genes {Gap43 and Gadd45 g) were 
chosen. Microarray signal intensities for aLL 
samples and controls at different time 
points after induction are shown as heat- 
maps. Arrows indicate in which ceLL types 
the criteria for up- or down regulation were 
fuLfiLLed. 

(B) Dynamic gene-expression analysis of 
the same genes using qRT-PCR, normalized 
to the respective uninduced ceLLs. The error 
bars represent SE of calculations based on 
ACt values obtained from two different 
housekeeping genes: Gapdh and Actb (n = 1). 
See also Table S5. 
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Finally, Mgarp, which is expressed in ESCs and required 
to establish pluripotency, does not play a role in ESC self- 
renewal, as Mgarp knockdown did not affect ESC 
morphology or proliferation (Figure S7). 

In summary, our data suggest that the initial decrease in 
Mgarp expression is a consequence of direct inhibition by 
KLF4 and that the subsequent increase in Mgarp expression 
is a direct effect of activation by OCT4. Most importantly, 
iPSC generation is severely impaired by interference with 
either the initial Mgarp decrease (OSKM + Mgarp) or the sub- 



sequent Mgarp increase (OSKM + shMgarp). Therefore, the 
counteracting regulatory activities of OCT4 and KLF4 
ensure the specific Mgarp time pattern of expression that 
is essential to induce pluripotency. 

DISCUSSION 

In this study, we derived different Oct4-inducible somatic 
cell types to identify early transcriptional events induced 
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Figure 6. Different Mgarp Expression Levels Required at Early and Late Stages during Reprogramming of Somatic Cells into iPSCs 
Are Competitively Regulated by 0CT4 and KLF4 

(A) Reprogramming of MEFs, NSCs, and BMCs into iPSCs using different factor combinations. Representative weLLs were stained for ALP 
activity (red) on day 21 after infection. 

(B) 0ct4-GFP-positiveiPSC colonies were counted on day 21 after infection with different factor combinations. The error bars represent SD 



of the mean (biological replicates, n = 5). p values refer to a two-tailed, unpaired Student's t test: *p < 10 ; **p < 10 



'^p<10" 



(C) Monitoring of Mgarp expression levels (qRT-PCR) during a 15-day period after infection of MEFs with different combinations of genes, 
normalized to ESCs. The error bars represent SD of the mean (biological replicates, n = 4). 

(D) 0ct4-GFP-positive iPSC generated after infection of MEFs with four factors plus tetracycline-inducible (Tet-) vectors for Mgarp 
overexpression or knockdown. The expression or inhibition was induced at different time points, as indicated. The error bars represent SD 



of the mean (biological replicates, n = 5). p values referto a two-tailed, unpaired Student's t test: *p<10~ ; **p<10" 
not significant (p > 0.1). Abbreviations: shRNA, small hairpin RNA; OSKM, 0ct4, Sox2, Klf4, and A/yc. 
See also Figures S4-S7 and Table S5. 
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by the exogenous OCT4 protein during cellular reprog- 
ramming. Our global gene-expression analyses revealed 
that OCT4 does not induce expression of pluripotency- 
related genes in somatic cells during the first 24 hr after 
its overexpression. On the contrary, many ESC-expressed 
genes are downregulated by OCT4 in somatic cells. In 
addition, our data demonstrate that OCT4 activates and 
inhibits different genes in a cell-type-dependent manner. 
The OCT4-upregulated genes are highly variable from one 
cell type to another and do not present any common 
ontological pattern. Some genes were upregulated in 
one cell type while being downregulated in other cell 
types. Thus, our results suggest that OCT4 alone can 



induce a general perturbation in the somatic cell tran- 
scriptional network at the early steps of reprogramming, 
partially through the downregulation of cell-type-specific 
genes. Indeed, OCT4 has been reported to not only 
activate pluripotent genes but also repress developmental 
regulators in ESCs (Boyer et al, 2005; Rodda et al, 2005). 
However, the sequences to which OCT4 is bound in 
ESCs do not maintain the same accessibility in different 
types of somatic cells (Koche et al., 2011; Soufi et al., 
2012). Therefore, different cell-type-dependent chro- 
matin signatures could explain the differences in gene 
expression between MEFs, NSCs, and BMCs after Oct4 
induction. 
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Figure 7. 0CT4 and KLF4 Bind to the 
Mgarp Genomic Locus 

(A) Position of 13 ChlP-qPCR ampLicons (a- 
m) spanning approximateLy 5 kb up- and 
downstream of the Mgarp transcriptional 
start site. BLue bars indicate the covered 
regions, as estimated by the average size of 
crossLinked DNA fragments. The asterisk 
indicates position of a putative 0CT4 bind- 
ing motif within the first intron of Mgarp; 
double asterisks indicate position of a pu- 
tative KLF4 binding motif within the Mgarp 
promoter. 

(B) ChlP-qPCR analysis after a-0CT4 immu- 
noprecipitation of crossLinked DNA isolated 
from 0ct4-overexpressing MEFs. Relative 
occupancies to a negative control region 
(neg. ctrL, rDNA-28S). a-m, ampLicons as 
specified in (A); pos. ctrL, 0CT4 binding site 
positive control within regulatory region of 
0ct4. 

(C) ChlP-qPCR analysis after a-KLF4 immu- 
noprecipitation of crosslinked DNA isolated 
from /C//4-overexpressing MEFs. pos. Ctrl., 
KLF4 binding site positive control within 
regulatory region of Cdhl. The error bars in 
(B) and (C) represent SEM (technical repli- 
cates, n = 3). 

(D and E) Conservation of the identified 
0CT4 (D) and KLF4 (E) motif in different 
mammalian genomes. 
See also Tables S5 and S6. 



Fibroblasts can be directly reprogrammed into neurons 
(Vierbuchen et al., 2010), cardiomyocytes (leda et al, 

2010) , and NSCs (Han et al., 2012) using tissue-specific 
combinations of transcription factors. However, the plurip- 
otent reprogramming cocktail of Oct4, Sox2, Klf4, and Myc 
can also convert fibroblasts into cardiomyocytes (Efe et al, 

2011) and NSCs (Kim et al., 2011; Thier et al, 2012) under 
suitable culture conditions. In addition, the overexpression 
of only Oct4 has been reported to reprogram fibroblasts 
toward the hematopoietic lineage (Szabo et al., 2010). In 
these studies, the overexpression of Oct4 alone or in combi- 
nation with other transcription factors is thought to force 
the cells into an unstable and plastic intermediate (Orkin 
and Hochedlinger, 2011) that can be pushed toward the 
desired final cell type by specific environmental cues. We 



speculate that the initial stochastic phase of the reprogram- 
ming process (Buganim et al., 2012; Hanna et al., 2009) 
involves a transient transcriptionally unstable state that 
may be a prerequisite to the hierarchical events that take 
place during the late steps of reprogramming (Buganim 
et al., 2012). In addition, the reprogramming transgenes 
are usually expressed at higher levels during reprogram- 
ming than their corresponding endogenous counterparts 
in ESCs. These nonphysiological conditions may promote 
nonspecific binding to low-affinity binding sites and 
contribute to transcriptional chaos and instability. Our 
data suggest that Oct4 overexpression at early stages of 
reprogramming plays a role in the initiation of such an 
unstable intermediate through the inhibition of the so- 
matic cell-type-specific program rather than through the 
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activation of the pluripotent transcriptional network. Our 
results are consistent with previous four factor studies 
showing that fibroblast-specific genes are downregulated 
at early stages and that pluripotency-related genes are not 
upregulated until the late stages of reprogramming (Bram- 
brink et al, 2008; Stadtfeld et al., 2008). Although MYC was 
suggested to be the reprogramming factor primarily 
responsible for suppressing the cell-type-specific program 
(Sridharan et al, 2009), our data show that OCT4 alone 
can also inhibit somatic transcriptional networks. Finally, 
current available data suggest that OCT4, SOX2, and 
KLF4 act synergistically, as OCT4 heterodimerizes with 
SOX2 in order to maintain ESC self-renewal (Boyer et al., 
2005) and OCT4, SOX2, and KLF4 co-occupy the pro- 
moters of many reprogramming-related genes (Soufi 
et al., 2012; Sridharan et al, 2009). In contrast, we show 
that OCT4 alone is able to initiate certain steps in reprog- 
ramming, such as the activation of Mgarp, and that OCT4 
and KLF4 play an antagonistic role on Mgarp transcrip- 
tional regulation. 

Whereas each cell type exhibited a unique pattern of up- 
and downregulated genes, four genes were upregulated by 
OCT4 in all examined somatic cell types, including Mgarp. 
Our results show that the process of inducing de novo 
pluripotency does not simply consist of the activation of 
ESC-specific genes to the levels present in ESCs. Instead, 
some genes such as Mgarp need to be first downregulated 
and subsequently upregulated during the reprogramming 
process. We have elucidated the mechanism underlying 
this counterintuitive temporal expression pattern. In fact, 
KLF4 alone completely abolishes Mgarp expression whereas 
OCT4 alone induces high levels of Mgarp. This competitive 
interplay ensures that appropriate expression levels of 
Mgarp are maintained at different time points, which appear 
to be crucial for a successful reprogramming process, as 
either the permanent inhibition or the premature activa- 
tion of Mgarp prevents the efficient generation of iPSCs. 
To our knowledge, Mgarp is the first gene described to date 
upon which KLF4 and OCT4 exhibit antagonistic effects. 
The mechanism that regulates the switch from KLF4- to 
OCT4-regulated Mgarp expression remains to be identified. 

In summary, we have dissected the specific role that 
OCT4 plays during the first steps of reprogramming inde- 
pendently of the other reprogramming factors. We have 
shown that OCT4 does not require SOX2 to interfere 
with cell-type-specific gene-expression profiles or to 
initiate an unstable transcriptional state that facilitates re- 
programming. Finally, we have discovered a mechanism 
by which OCT4 and KLF4 compete to differentially regu- 
late Mgarp expression, resulting in distinct expression 
levels at different stages of reprogramming. Therefore, 
our study sheds new light on the function of OCT4 in 
the reprogramming process. 



EXPERIMENTAL PROCEDURES 

Cell Derivation and Culture 

Different somatic cell types were derived from transgenic mice 
containing a tetracycline-inducible transactivator (rtTA-M2), a 
TO (Hochedlinger et al., 2005), and a GOF18 Ocf4-GFP transgene 
in which the GFP is driven by 18 kb of the Oct4 regulatory region 
(Yoshimizu et al., 1999). In addition, a control cell line was gener- 
ated from transgenic mice, comprising a tetracycline-inducible 
transactivator (irtTA-VP16-GBD; Anastassiadis et al., 2002) plus 
an OG2 Ocf4-GFP reporter transgene (Yoshimizu et al., 1999) but 
lacking a TO cassette. MEFs and NSCs were derived from 14.5 
dpc embryos and cultured as previously described (Conti et al., 
2005). BMCs were isolated from 6-week-old mice after flushing 
the femora and tibiae and plated onto dishes with a combined 
coating of gelatin (PAA Laboratories), laminin (Santa Cruz Biotech- 
nology), and poly-L-lysine (Sigma), and cultivated in Dulbecco's 
modified Eagle's medium (DMEM) Low Glucose (Gibco) with 
10% fetal bovine serum, 100 U/ml penicillin, 100 [ig/ml strepto- 
mycin, and 2 mM/1 glutamine (all PAA). The transgenic Oct4 
was induced by either 6 jig/ml doxycycline (Sigma) or 2 )ig/ml 
doxycycline plus 10~^ M dexamethasone (Sigma), depending on 
the respectively contained tetracycline transactivator. 5-azacyti- 
dine (10 nM; Sigma) was applied as indicated. Generation, culture, 
and differentiation of ESCs and iPSCs were performed as previously 
described (Tiemann et al., 2011). Animal handling was in accor- 
dance with the Max Planck Institute animal protection guidelines 
and the German animal protection laws. 

Viral Vectors 

Production and transduction of viral vectors were performed as 
reported (Han et al., 2010). Retroviral pMXs vectors encoding 
for Ocf4, Sox2, Klf4, and Myc (Takahashi and Yamanaka, 2006) 
as well as the lentiviral vectors TetOP-CMV-Ocf4 and M2-rtTA tetra- 
cycline-transactivator (Stadtfeld et al., 2008) were purchased from 
Addgene (plasmids numbers 13366, 13367, 13370, 13375, 19766, 
and 20342). Mgarp cDNA was amplified and cloned into the pMXs 
and TetOP-CMV backbones. shRNA constructs targeting Mgarp 
were cloned into the lentiviral vectors pLVTHM (Wiznerowicz 
and Trono, 2003; Addgene number 12247) and Tet-pLKO-puro 
(Wee et al., 2008; Addgene number 21915). A iC/f4-2a-tomato over- 
expression cassette was also cloned into the pLVTHM backbone 
(Han et al, 2011). 

iPSC Characterization 

Immunocytochemistry, immunoblotting, ALP staining, methyl- 
ation analysis, and blastocyst injection were performed as pre- 
viously described (Han et al., 2011, 2013; Tiemann et al., 2011). 
Relative transcript expression levels were calculated based on 
the AACt method after normalization to Gapdh and Acth house- 
keeping controls (Kim et al., 2008). Primers and antibodies are 
listed in Tables S5 and S6, respectively. 

Measurement of Mitochondrial Activity and Copy 
Number 

Mitochondria were stained for 30 min with 30 nM tetramethylr- 
hodamine methyl ester (Invitrogen) and quantified on a BD 
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FACSAria II flow cytometer (Beet on Dickinson). Copy numbers 
of mitochondrial DNA were assessed by quantitative PGR (Kelly 
et al., 2013). 

Chip 

MEFs overexpressing Oct4 (TO-MEFs after doxycycline treatment) 
or Klf4 (MEFs sorted for tomato expression after transduction 
with LVTHM-X/f4-2a-tomato) were used for ChlP. Cell cultures 
were crosslinked with 1% formaldehyde for 10 min at room tem- 
perature and sonicated using a Bioruptor device (Diagenode). 
Chromatin was immunoprecipitated with the corresponding 
antibodies, and the precipitated DNA was analyzed by qRT- 
PCR. Primers and antibodies are listed in Tables S5 and S6, 
respectively. 

Microarray Gene-Expression Analysis 

Sample preparation and analysis was performed as previously 
described (Tapia et al., 2012). Score values representing up- and 
downregulated genes were calculated from the processed raw 
data as follows (Figure S2). Signal changes for each probe and 
each cell type were calculated by subtraction of the logarithmic 
values between 0 hr and 24 hr (Atotai)- To reduce the impact of 
high relative changes in noisy background expression, the signal 
change was set to zero if the signal intensity never reached a 
threshold of six. Next, two correction parameters were introduced. 
The first parameter compensates for signal changes that do not 
show a gradual tendency throughout the time course. To this 
end, subtraction values between 6 and 0 hr, 12 and 6 hr, and 24 
and 12 hr were calculated only for those signal intensities greater 
than six. If any of these intermediate steps presents a sign (positive 
or negative) that differs from that of the overall trend, this value is 
subtracted from the Atotai value. The second correction parameter 
compensates for signal changes that are observed not only in the 
sample cell type but also in one or both controls. In this case, the 
subtraction value between 24 and 0 hr of the control cell line 
that presents the most similar behavior to the studied sample 
was subtracted from Atotai- Finally, the sign of the score after sub- 
tracting the two correction parameters cannot become different 
from the original sign of Atotai- Probes with a score higher than 1 
or lower than -1 were considered up- or downregulated, respec- 
tively, which corresponds to more than a 2-fold total change in 
expression. 

Gene ontology analyses were performed using the Functional 
Annotation tool of the DAVID bioinformatics database (Huang 
da et al., 2009). 
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